The Normalization of Occurrence and Co-occurrence Matrices in Bibliometrics using Cosine Similarities and Ochiai Coefficients

نویسندگان

  • Qiuju Zhou
  • Loet Leydesdorff
چکیده

We prove that Ochiai similarity of the co-occurrence matrix is equal to cosine similarity in the underlying occurrence matrix. Neither the cosine nor the Pearson correlation should be used for the normalization of co-occurrence matrices because the similarity is then normalized twice, and therefore over-estimated; the Ochiai coefficient can be used instead. Results are shown using a small matrix (5 cases, 4 variables) for didactic reasons, and also Ahlgren et al.’s (2003) co-occurrence matrix of 24 authors in library and information sciences. The over-estimation is shown numerically and will be illustrated using multidimensional scaling and cluster dendograms. If the occurrence matrix is not available (such as in internet research or author co-citation analysis) using Ochiai for the normalization is preferable to using the cosine.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image Steganalysis Based on Co-Occurrences of Integer Wavelet Coefficients

We present a steganalysis scheme for LSB matching steganography based on feature vectors extracted from integer wavelet transform (IWT). In integer wavelet decomposition of an image, the coefficients will be integer, so we can calculate co-occurrence matrix of them without rounding the coefficients. Before calculation of co-occurrence matrices, we clip some of the most significant bitplanes of ...

متن کامل

Severity and Co-occurrence of Oral and Verbal Apraxias in Left Brain Damaged Adults

Objective: Oral and verbal apraxias represent motor programming deficits of nonverbal and verbal movements respectively. Studying their properties may shed light on speech motor control processes. This study was focused on identifying cases with oral or verbal apraxia, their co–occurrences and severities. Materials & Methods: In this non-experimental study, 55 left adult subjects with left b...

متن کامل

A Century of Internal Auditing -Using Computational Literature Review

A review of the internal audit literature over the past century reveals a significant topic variety and thematic diversity, so that a systematic review of these studies is necessary in order to gain a deeper understanding of internal audit research. This research aims to fill the gap in previous review research and in response to the recommendation of Behrend & Eulerich (2019) and aims to ident...

متن کامل

Drawing Word co-occurrence map of Spinal Muscular Atrophy disease

Introduction:  The purpose of this article is to evaluate the status of articles in the field of Spinal Muscular Atrophy According to the Scientometrics indices Word co-occurrence map of this field . Methods: The present study is an applied one with a quantitative approach and a descriptive approach. It has been done using scientometrics and the co-occurrence words analysis technique. Document...

متن کامل

Incremental Cosine Computations for Search and Exploration of Tag Spaces

Tags are often used to describe user-generated content on the Web. However, the available Web applications are not incrementally dealing with new tag information, which negatively influences their scalability. Since the cosine similarity between tags represented as co-occurrence vectors is an important aspect of these frameworks, we propose two approaches for an incremental computation of cosin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JASIST

دوره 67  شماره 

صفحات  -

تاریخ انتشار 2016